Camera control using system sensor data

ABSTRACT

A method for using cameras in an augmented reality headset is provided. The method includes receiving a signal from a sensor mounted on a headset worn by a user, the signal being indicative of a user intention for capturing an image. The method also includes identifying the user intention for capturing the image, based on a model to classify the signal from the sensor according to the user intention, selecting a first image capturing device in the headset based on a specification of the first image capturing device and the user intention for capturing the image, and capturing the image with the first image capturing device. An augmented reality headset, a memory storing instructions, and a processor to execute the instructions to cause the augmented reality headset as above are also provided.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present disclosure is related and claims priority under 35 USC. §119(e) to U.S. Prov. Pat. Appln. No. 63/227,228, entitled CAMERA CONTROLUSING SYSTEM SENSOR DATA, filed on Jul. 29, 2021, and to U.S. Prov. Pat.Appln. No. 63/219,266, entitled CAMERA CONTROL USING SYSTEM SENSOR DATA,filed on Jul. 7, 2021, to Sapna SHROFF et-al., the contents of whichapplications are hereby incorporated herein by reference in theirentirety, for all purposes.

BACKGROUND Field

The present disclosure is related to user interfaces in smart glassdevices that include one or more cameras for recording images and video.More specifically, the present disclosure is related to methods forautomatically selecting a camera from the one or more cameras for thesmart glass device whose settings are better suited for collecting animage based on user input and gestures.

Related Art

In today's wearable platforms, the devices include multiple cameras,sensors, and actuators configured to perform multiple ad-hoc functions.In some instances, more than one of these accessory devices may beoperational at the same time, while only one of them provides the bestuse of capabilities for the task at hand. However, these multipledevices typically lack an automated mechanism for activating orde-activating, thereby resulting in cumbersome user interaction toselect the devices to be operative for a selected task, or the waste ofscarce power resources in having a non-operational, active device.

SUMMARY

In a first embodiment, an augmented reality headset includes a firstcamera and a second camera mounted on a frame, having a first field ofview, and a second field of view, respectively, a sensor mounted on theframe, a memory configured to store multiple instructions, and one ormore processors configured to execute the instructions to cause theaugmented reality headset to perform a method. The method includeprocesses to: receive a signal from the sensor mounted, the signal beingindicative of an intention of a user for capturing an image, identifythe intention of the user for capturing the image, based on a model toclassify the signal from the sensor according to the intention of theuser, select one of the first camera or the second camera based on thefirst field of view, the second field of view, and the intention of theuser for capturing the image and to capture the image with a selectedcamera.

In a second embodiment, a computer-implemented method includes receivinga signal from a sensor mounted on a headset worn by a user, the signalbeing indicative of a user intention for capturing an image. Thecomputer-implemented method also includes identifying the user intentionfor capturing the image, based on a model to classify the signal fromthe sensor according to the user intention, selecting a first imagecapturing device in the headset based on a specification of the firstimage capturing device and the user intention for capturing the image,and capturing the image with the first image capturing device.

In a third embodiment, a non-transitory, computer-readable medium storesinstructions which, when executed by one or more processors, cause acomputer to execute a method. The method includes receiving a signalfrom a sensor mounted on a headset worn by a user, the signal beingindicative of a user intention for capturing an image. The method alsoincludes identifying the user intention for capturing the image, basedon a model to classify the signal from the sensor according to the userintention, selecting a first image capturing device in the headset basedon a specification of the first image capturing device and the userintention for capturing the image, and capturing the image with thefirst image capturing device.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an architecture including one or more wearabledevices coupled to one another, to a mobile device, a remote server andto a database, according to some embodiments.

FIG. 2A illustrates a user with a smart glass in a first configuration,according to some embodiments.

FIG. 2B illustrates a user with a smart glass in a second configuration,according to some embodiments.

FIG. 3 is a flow chart illustrating steps in a method for controllingone or more cameras in a smart glass device using multiple sensor data,according to some embodiments.

FIG. 4 is a block diagram illustrating an exemplary computer system withwhich a headset and methods for use of the same can be implemented,according to some embodiments.

In the figures, elements having the same or similar label number sharethe same or similar features, unless stated explicitly otherwise.

DETAILED DESCRIPTION

In the following detailed description, numerous specific details are setforth to provide a full understanding of the present disclosure. It willbe apparent, however, to one ordinarily skilled in the art, thatembodiments of the present disclosure may be practiced without some ofthese specific details. In other instances, well-known structures andtechniques have not been shown in detail so as not to obscure thedisclosure. Embodiments as disclosed herein should be considered withinthe scope of features and other embodiments illustrated in Appendix I,filed concurrently herewith.

A multi-camera or multi-imager smart glass system may include multiplecameras or imagers that may serve different functions. For instance, asystem may have cameras that point in different directions so that eachcovers different portions of a view-sphere. In some embodiments, it ispossible to keep all, most, or at least some of the cameras ON at alltimes, or substantially at all times, or most of the time, or at leastfor some time to ensure complete coverage of the view-sphere. However,this consumes power, creates unnecessary data, and subsequentmanagement, and is generally inefficient. Hence, it is desirable tocontrol the system efficiently, by triggering capture from only thosecameras that would view relevant activity.

Manually controlling such a system by directly controlling andtriggering or switching each camera ON-OFF based on view and relevantactivity moment-by-moment could be an inconvenient and disruptiveexperience for a user. Hence, there is a need for a system levelauto-detection of relevant activity to use for efficient andunsupervised/minimally-supervised camera control.

We propose a system level method to detect relevant activity to use forefficient and unsupervised/minimally-supervised camera control. Weconsider an embodiment of a multi-view system, with multiple camerascapturing different portions of a view-sphere. In some embodiments, thesystem is built-in or attached to a pair of spectacles.

A system as disclosed herein uses sensors on the device, such as cameraimage streams, hand gestures, IMU data, audio cues, or a few activebutton presses which can be used alone or in combination to estimatewhere the relevant activity may be occurring in the user's view-sphere.Eye tracking sensors may be used to determine the gaze of the user. IMUdata may be used to determine the head angle, indicating whether theyare looking ahead or looking downwards. Hand gestures captured by acamera stream may indicate current region of interest. Audio cues maysignal for instance the name of a person that needs to be captured. Acamera stream may be used to detect and recognize or track the relevantperson in the field of view. Location and context sensing may be used toestablish relevance of activity. These data could be used singly or incombination to estimate the region of relevance. The system may applyuse case prioritized models or learn appropriate models with machinelearning/deep learning from various use situations to arrive at goodestimates. Once the relevant region is identified, the system couldtrigger events required for the appropriate camera such as appropriateexposure/other control and full capture.

The user would like to capture their view facing distant scenery with acamera that faces forward, and perhaps has a large field of view. Whenthe user looks down towards their hands, say, as they frost cookies ormake any other detailed manual work, they may need to switch to adifferent downward facing camera with a different field of view andpossibly different image quality (for example, a narrower field of viewwith a higher resolution).

FIG. 1 illustrates an architecture 10 including one or more wearabledevices 100-1 (e.g., a smart glass) and 100-2 (e.g., a smart watch)coupled to one another (hereinafter, collectively referred to as“wearable devices 100”), to a mobile device 110, a remote server 130,and to a database 152, according to some embodiments. Smart glass 100-1may be configured for AR/VR applications, and mobile device 110 may be asmart phone, all of which may communicate with one another via wirelesscommunications and exchange a first dataset 103-1. Dataset 103-1 mayinclude a recorded video, audio, or some other file or streaming media.A user 101 of wearable devices 100 is also the owner or is associatedwith mobile device 110. In some embodiments, the smart glass maydirectly communicate with the remote server, the database, or any otherclient device (e.g., a smart phone of a different user, and the like)via the network. The mobile device may be communicatively coupled with aremote server and a database via a network 150, and transmit/shareinformation, files, and the like with one another, e.g., dataset 103-2and dataset 103-3 (hereinafter, collectively referred to as “datasets103”). Network 150 may include, for example, any one or more of a localarea network (LAN), a wide area network (WAN), the Internet, and thelike. Further, the network can include, but is not limited to, any oneor more of the following network topologies, including a bus network, astar network, a ring network, a mesh network, a star-bus network, treeor hierarchical network, and the like.

Smart glass 100-1 may include a frame 105 including eyepieces 107 toprovide an image to user 101. A camera 115 (e.g., forward-looking) ismounted on frame 105, and has a field of view (FOV). A sensing device128 facing the user is configured to track a pupil position of the user.Processor 112 is configured to identify a region of interest (ROI)within the image viewed by user 101. An interface device 129 indicatesto user 101 that the FOV of camera 115 at least partially misses the ROIof the user. In some embodiments, smart glass 100-1 may also include ahaptic actuator 125 to recreate a sense of touch to the user, for aVR/AR application, and a speaker 127 to communicate to user 101 voice orsound signals indicative of adjusting a gaze direction for improving theFOV of camera 115 (e.g., obtained with pupil tracking information fromsensing device 128). For example, in some embodiments, haptic actuator125 may include a vibrating component to indicate to the user to nudgetheir head position in a desired direction to align the FOV offorward-looking camera 115 with the ROI, or to confirm to the user thatthe FOV is properly centered on the ROI.

In some embodiments, smart glass 100-1 may include multiple sensors 121such as IMUs, gyroscopes, microphones, and capacitive sensors configuredas touch interfaces for the user. Other touch sensors may include apressure sensor, a thermometer, and the like.

In addition, wearable devices 100, or mobile device 110, may include amemory circuit 120 storing instructions, and a processor circuit 112configured to execute the instructions to cause smart glass 100-1 toperform, at least partially, some of the steps in methods consistentwith the present disclosure. Memory circuit 120 may also store data,such as calibration data for the position and orientation of camera 115relative to the FOV of the user. In some embodiments, smart glass 100-1,mobile device 110, server 130, and/or database 152 may further include acommunications module 118 enabling the device to wirelessly communicatewith remote server 130 via network 150. Smart glass 100-1 may thusdownload a multimedia online content (e.g., dataset 103-1) from remoteserver 130, to perform at least partially some of the operations inmethods as disclosed herein. In some embodiments, memory 120 may includeinstructions to cause processor 112 to receive and combine signals fromsensors 121, avoid false positives, and better assess user intentionsand commands when an input signal is received from a user interface.

FIGS. 2A-2B illustrate two configurations with a user 201 wearing smartglass 200. Smart glass 200 includes two cameras 215A and 215B(hereinafter, collectively referred to as “cameras 215”). Cameras 215have each field of view, FOV 220A and 220B, respectively (hereinafter,collectively referred to as “FOVs 220”). FOVs 220 are generallydifferent, and their characteristics depend on the specifications ofcameras 215. For example, FOV 220A points straight ahead of the face ofuser 201, and is wider than FOV 220B, which points straight down, closeto the user's body.

FIG. 2A illustrates a user 201 with smart glass 200 in a firstconfiguration, according to some embodiments. Accordingly, user 201 maybe focused on an item straight ahead of her, and therefore, first camera215A (at the top of the user's right eyepiece) may be better suited tocapture an image of an object of interest within a user FOV 200A. Asystem as disclosed herein then selects to activate camera 215A asillustrated, having an angle of view straight ahead of the user, and awide field of view 220A.

FIG. 2B illustrates user 201 with smart glass 200 in a secondconfiguration, according to some embodiments. Accordingly, in thissecond configuration, user 201 may be focused on an item 230 withinnarrower field of view 220B, straight below from the user's face, at thelevel of the user's hands. In this case, a system as disclosed hereinmay select to activate a second camera 215B, at the top of the user'sleft eyepiece, which may be configured with a down looking, narrowerfield of view 220B (in comparison with the first camera).

In some embodiments, the system is configured to automatically switchbetween the first camera and the second camera as the user switchesattitude and pose, without need for a user input. In some embodiments, acertain degree of user input may be desirable, when there is ambiguitybetween different user gestures, or when the two or more cameras mayhave competitive specifications in relation to the object of interestfor the user.

FIG. 3 is a flow chart illustrating steps in a method 300 forcontrolling one or more cameras in a smart glass device using multiplesensor data (e.g., smart glasses 100 and 200, and sensors 121),according to some embodiments. The smart glass may also include one ormore cameras, a sensing device, a microphone, a speaker, and a hapticactuator mounted on a frame (e.g., camera 115, sensing device 128,interface device 129, speaker 127, and haptic actuator 125). The smartglass may also include a communications module 118 to transmit andreceive datasets with a mobile device or a server, through a network,while performing one or more steps in method 300 (e.g., communicationsmodule 118, client device 110, server 130, datasets 103, and network150). In embodiments consistent with the present disclosure, at leastone of the steps in method 300 may be performed by a processor executinginstructions stored in a memory circuit (e.g., processor 112, memory120). In some embodiments, a method consistent with the presentdisclosure may include one or more of the steps in method 300 performedin a different order, simultaneously, quasi-simultaneously, oroverlapping in time.

Step 302 includes receiving a signal from a sensor mounted on a smartglass worn by a user, the signal being indicative of a user intention.In some embodiments, step 302 includes one of receiving an inertialsignal from an inertial motion sensor, receiving a sound capture from auser voice, receiving a hand gesture, or receiving an active buttonpress. In some embodiments, step 302 includes identifying a hand gestureof the user indicative of an object of interest. In some embodiments,step 302 includes receiving a pupil position of the user from an eyetracking device mounted on the headset. In some embodiments, step 302includes identifying an orientation of the headset, and selecting thefirst image capturing device comprises selecting a camera whose field ofview is aligned with the orientation of the headset.

Step 304 includes identifying the user intention based on a model toclassify the signal from the sensor according to the user intention.

Step 306 includes selecting a first image capturing device in the smartglass based on a specification of the first image capturing device andthe user intention. In some embodiments, step 306 includes selecting thefirst image capturing device when a field of view of the first imagecapturing device includes a point of interest in a field of view for theuser, in the smart glass. In some embodiments, step 306 includesde-activating the at least one image capturing device in the smart glasswhen the user intention is incompatible with the specification of theimage capturing device. In some embodiments, step 306 includesselecting, from one or more image capturing devices in the smart glass,the image capturing device whose specification best matches the userintention. In some embodiments, step 306 includes selecting a secondimage capturing device and de-activating the first image capturingdevice based on a second user intention. In some embodiments, thespecification of the first image capturing device is a field of view,and step 306 includes verifying that the field of view includes anobject of interest identified within the user intention.

Step 308 includes capturing the image with the first image capturingdevice.

The subject technology is illustrated, for example, according to variousaspects described below. Various examples of aspects of the subjecttechnology are described as numbered claims (claim 1, 2, etc.) forconvenience. These are provided as examples, and do not limit thesubject technology.

In one aspect, a method may be an operation, an instruction, or afunction and vice versa. In one aspect, a claim may be amended toinclude some or all of the words (e.g., instructions, operations,functions, or components) recited in either one or more claims, one ormore words, one or more sentences, one or more phrases, one or moreparagraphs, and/or one or more claims.

Hardware Overview

FIG. 4 is a block diagram illustrating an exemplary computer system 400with which the headset 100 of FIG. 1 , and method 300 can beimplemented, according to some embodiments. In certain aspects, computersystem 400 may be implemented using hardware or a combination ofsoftware and hardware, either in a dedicated server, or integrated intoanother entity, or distributed across multiple entities. Computer system400 may include a desktop computer, a laptop computer, a tablet, aphablet, a smartphone, a feature phone, a server computer, or otherwise.A server computer may be located remotely in a data center or be storedlocally.

Computer system 400 includes a bus 408 or other communication mechanismfor communicating information, and a processor 402 (e.g., processor 112)coupled with bus 408 for processing information. By way of example, thecomputer system 400 may be implemented with one or more processors 402.Processor 402 may be a general-purpose microprocessor, amicrocontroller, a Digital Signal Processor (DSP), an ApplicationSpecific Integrated Circuit (ASIC), a Field Programmable Gate Array(FPGA), a Programmable Logic Device (PLD), a controller, a statemachine, gated logic, discrete hardware components, or any othersuitable entity that can perform calculations or other manipulations ofinformation.

Computer system 400 can include, in addition to hardware, code thatcreates an execution environment for the computer program in question,e.g., code that constitutes processor firmware, a protocol stack, adatabase management system, an operating system, or a combination of oneor more of them stored in an included memory 404 (e.g., memory 120),such as a Random Access Memory (RAM), a flash memory, a Read-Only Memory(ROM), a Programmable Read-Only Memory (PROM), an Erasable PROM (EPROM),registers, a hard disk, a removable disk, a CD-ROM, a DVD, or any othersuitable storage device, coupled with bus 408 for storing informationand instructions to be executed by processor 402. The processor 402 andthe memory 404 can be supplemented by, or incorporated in, specialpurpose logic circuitry.

The instructions may be stored in the memory 404 and implemented in oneor more computer program products, e.g., one or more modules of computerprogram instructions encoded on a computer-readable medium for executionby, or to control the operation of, the computer system 400, andaccording to any method well known to those of skill in the art,including, but not limited to, computer languages such as data-orientedlanguages (e.g., SQL, dBase), system languages (e.g., C, Objective-C,C++, Assembly), architectural languages (e.g., Java, .NET), andapplication languages (e.g., PHP, Ruby, Perl, Python). Instructions mayalso be implemented in computer languages such as array languages,aspect-oriented languages, assembly languages, authoring languages,command line interface languages, compiled languages, concurrentlanguages, curly-bracket languages, dataflow languages, data-structuredlanguages, declarative languages, esoteric languages, extensionlanguages, fourth-generation languages, functional languages,interactive mode languages, interpreted languages, iterative languages,list-based languages, little languages, logic-based languages, machinelanguages, macro languages, metaprogramming languages, multiparadigmlanguages, numerical analysis, non-English-based languages,object-oriented class-based languages, object-oriented prototype-basedlanguages, off-side rule languages, procedural languages, reflectivelanguages, rule-based languages, scripting languages, stack-basedlanguages, synchronous languages, syntax handling languages, visuallanguages, wirth languages, and xml-based languages. Memory 404 may alsobe used for storing temporary variable or other intermediate informationduring execution of instructions to be executed by processor 402.

A computer program as discussed herein does not necessarily correspondto a file in a file system. A program can be stored in a portion of afile that holds other programs or data (e.g., one or more scripts storedin a markup language document), in a single file dedicated to theprogram in question, or in multiple coordinated files (e.g., files thatstore one or more modules, subprograms, or portions of code). A computerprogram can be deployed to be executed on one computer or on multiplecomputers that are located at one site or distributed across multiplesites and interconnected by a communication network. The processes andlogic flows described in this specification can be performed by one ormore programmable processors executing one or more computer programs toperform functions by operating on input data and generating output.

Computer system 400 further includes a data storage device 406 such as amagnetic disk or optical disk, coupled with bus 408 for storinginformation and instructions. Computer system 400 may be coupled viainput/output module 410 to various devices. Input/output module 410 canbe any input/output module. Exemplary input/output modules 410 includedata ports such as USB ports. The input/output module 410 is configuredto connect to a communications module 412. Exemplary communicationsmodules 412 include networking interface cards, such as Ethernet cardsand modems. In certain aspects, input/output module 410 is configured toconnect to a plurality of devices, such as an input device 414 and/or anoutput device 416. Exemplary input devices 414 include a keyboard and apointing device, e.g., a mouse or a trackball, by which a consumer canprovide input to the computer system 400. Other kinds of input devices414 can be used to provide for interaction with a consumer as well, suchas a tactile input device, visual input device, audio input device, orbrain-computer interface device. For example, feedback provided to theconsumer can be any form of sensory feedback, e.g., visual feedback,auditory feedback, or tactile feedback; and input from the consumer canbe received in any form, including acoustic, speech, tactile, or brainwave input. Exemplary output devices 416 include display devices, suchas an LCD (liquid crystal display) monitor, for displaying informationto the consumer.

According to one aspect of the present disclosure, wearable devices 100can be implemented, at least partially, using a computer system 400 inresponse to processor 402 executing one or more sequences of one or moreinstructions contained in memory 404. Such instructions may be read intomemory 404 from another machine-readable medium, such as data storagedevice 406. Execution of the sequences of instructions contained in mainmemory 404 causes processor 402 to perform the process steps describedherein. One or more processors in a multi-processing arrangement mayalso be employed to execute the sequences of instructions contained inmemory 404. In alternative aspects, hard-wired circuitry may be used inplace of or in combination with software instructions to implementvarious aspects of the present disclosure. Thus, aspects of the presentdisclosure are not limited to any specific combination of hardwarecircuitry and software.

Various aspects of the subject matter described in this specificationcan be implemented in a computing system that includes a back endcomponent, e.g., a data server, or that includes a middleware component,e.g., an application server, or that includes a front end component,e.g., a client computer having a graphical consumer interface or a Webbrowser through which a consumer can interact with an implementation ofthe subject matter described in this specification, or any combinationof one or more such back end, middleware, or front end components. Thecomponents of the system can be interconnected by any form or medium ofdigital data communication, e.g., a communication network. Thecommunication network (e.g., network 150) can include, for example, anyone or more of a LAN, a WAN, the Internet, and the like. Further, thecommunication network can include, but is not limited to, for example,any one or more of the following network topologies, including a busnetwork, a star network, a ring network, a mesh network, a star-busnetwork, tree or hierarchical network, or the like. The communicationsmodules can be, for example, modems or Ethernet cards.

Computer system 400 can include clients and servers. A client and serverare generally remote from each other and typically interact through acommunication network. The relationship of client and server arises byvirtue of computer programs running on the respective computers andhaving a client-server relationship to each other. Computer system 400can be, for example, and without limitation, a desktop computer, laptopcomputer, or tablet computer. Computer system 400 can also be embeddedin another device, for example, and without limitation, a mobiletelephone, a PDA, a mobile audio player, a Global Positioning System(GPS) receiver, a video game console, and/or a television set top box.

The term “machine-readable storage medium” or “computer-readable medium”as used herein refers to any medium or media that participates inproviding instructions to processor 402 for execution. Such a medium maytake many forms, including, but not limited to, non-volatile media,volatile media, and transmission media. Non-volatile media include, forexample, optical or magnetic disks, such as data storage device 406.Volatile media include dynamic memory, such as memory 404. Transmissionmedia include coaxial cables, copper wire, and fiber optics, includingthe wires forming bus 408. Common forms of machine-readable mediainclude, for example, floppy disk, a flexible disk, hard disk, magnetictape, any other magnetic medium, a CD-ROM, DVD, any other opticalmedium, punch cards, paper tape, any other physical medium with patternsof holes, a RAM, a PROM, an EPROM, a FLASH EPROM, any other memory chipor cartridge, or any other medium from which a computer can read. Themachine-readable storage medium can be a machine-readable storagedevice, a machine-readable storage substrate, a memory device, acomposition of matter affecting a machine-readable propagated signal, ora combination of one or more of them.

To illustrate the interchangeability of hardware and software, itemssuch as the various illustrative blocks, modules, components, methods,operations, instructions, and algorithms have been described generallyin terms of their functionality. Whether such functionality isimplemented as hardware, software, or a combination of hardware andsoftware depends upon the particular application and design constraintsimposed on the overall system. Skilled artisans may implement thedescribed functionality in varying ways for each particular application.

As used herein, the phrase “at least one of” preceding a series ofitems, with the terms “and” or “or” to separate any of the items,modifies the list as a whole, rather than each member of the list (e.g.,each item). The phrase “at least one of” does not require selection ofat least one item; rather, the phrase allows a meaning that includes atleast one of any one of the items, and/or at least one of anycombination of the items, and/or at least one of each of the items. Byway of example, the phrases “at least one of A, B, and C” or “at leastone of A, B, or C” each refer to only A, only B, or only C; anycombination of A, B, and C; and/or at least one of each of A, B, and C.

The word “exemplary” is used herein to mean “serving as an example,instance, or illustration.” Any embodiment described herein as“exemplary” is not necessarily to be construed as preferred oradvantageous over other embodiments. Phrases such as an aspect, theaspect, another aspect, some aspects, one or more aspects, animplementation, the implementation, another implementation, someimplementations, one or more implementations, an embodiment, theembodiment, another embodiment, some embodiments, one or moreembodiments, a configuration, the configuration, another configuration,some configurations, one or more configurations, the user technology,the disclosure, the present disclosure, other variations thereof andalike are for convenience only and do not imply that a disclosurerelating to such phrase(s) is essential to the user technology or thatsuch disclosure applies to all configurations of the user technology. Adisclosure relating to such phrase(s) may apply to all configurations,or one or more configurations. A disclosure relating to such phrase(s)may provide one or more examples. A phrase such as an aspect or someaspects may refer to one or more aspects and vice versa, and thisapplies similarly to other foregoing phrases.

A reference to an element in the singular is not intended to mean “oneand only one” unless specifically stated, but rather “one or more.”Pronouns in the masculine (e.g., his) include the feminine and neutergender (e.g., her and its) and vice versa. The term “some” refers to oneor more. Underlined and/or italicized headings and subheadings are usedfor convenience only, do not limit the user technology, and are notreferred to in connection with the interpretation of the description ofthe user technology. Relational terms such as first and second and thelike may be used to distinguish one entity or action from anotherwithout necessarily requiring or implying any actual such relationshipor order between such entities or actions. All structural and functionalequivalents to the elements of the various configurations describedthroughout this disclosure that are known or later come to be known tothose of ordinary skill in the art are expressly incorporated herein byreference and intended to be encompassed by the user technology.Moreover, nothing disclosed herein is intended to be dedicated to thepublic regardless of whether such disclosure is explicitly recited inthe above description. No claim element is to be construed under theprovisions of 35 U.S.C. § 112, sixth paragraph, unless the element isexpressly recited using the phrase “means for” or, in the case of amethod claim, the element is recited using the phrase “step for.”

While this specification contains many specifics, these should not beconstrued as limitations on the scope of what may be described, butrather as descriptions of particular implementations of the user matter.Certain features that are described in this specification in the contextof separate embodiments can also be implemented in combination in asingle embodiment. Conversely, various features that are described inthe context of a single embodiment can also be implemented in multipleembodiments separately or in any suitable subcombination. Moreover,although features may be described above as acting in certaincombinations and even initially described as such, one or more featuresfrom a described combination can, in some cases, be excised from thecombination, and the described combination may be directed to asubcombination or variation of a subcombination.

The user matter of this specification has been described in terms ofparticular aspects, but other aspects can be implemented and are withinthe scope of the above claims. For example, while operations aredepicted in the drawings in a particular order, this should not beunderstood as requiring that such operations be performed in theparticular order shown or in sequential order, or that all illustratedoperations be performed, to achieve desirable results. The actionsrecited in the claims can be performed in a different order and stillachieve desirable results. As one example, the processes depicted in theaccompanying figures do not necessarily require the particular ordershown, or sequential order, to achieve desirable results. In certaincircumstances, multitasking and parallel processing may be advantageous.Moreover, the separation of various system components in the aspectsdescribed above should not be understood as requiring such separation inall aspects, and it should be understood that the described programcomponents and systems can generally be integrated together in a singlesoftware product or packaged into multiple software products.

The title, background, drawings and description thereof are herebyincorporated into the disclosure and are provided as illustrativeexamples of the disclosure, not as restrictive descriptions. It issubmitted with the understanding that they will not be used to limit thescope or meaning of the claims. In addition, in the detaileddescription, it can be seen that the description provides illustrativeexamples, and the various features are grouped together in variousimplementations for the purpose of streamlining the disclosure. Themethod of disclosure is not to be interpreted as reflecting an intentionthat the described user matter requires more features than are expresslyrecited in each claim. Rather, as the claims reflect, inventive usermatter lies in less than all features of a single disclosedconfiguration or operation. The claims are hereby incorporated into thedetailed description, with each claim standing on its own as aseparately described user matter.

The claims are not intended to be limited to the aspects describedherein but are to be accorded the full scope consistent with thelanguage claims and to encompass all legal equivalents. Notwithstanding,none of the claims are intended to embrace user matter that fails tosatisfy the requirements of the applicable patent law, nor should theybe interpreted in such a way.

What is claimed is:
 1. A computer-implemented method, comprising:receiving a signal from a sensor mounted on a headset worn by a user,the signal being indicative of a user intention for capturing an image;identifying the user intention for capturing the image, based on a modelto classify the signal from the sensor according to the user intention;selecting a first image capturing device in the headset based on aspecification of the first image capturing device and the user intentionfor capturing the image; and capturing the image with the first imagecapturing device.
 2. The computer-implemented method of claim 1, whereinreceiving a signal from a sensor comprises one of receiving an inertialsignal from an inertial motion sensor, receiving a sound capture from auser voice, receiving a hand gesture, or receiving an active buttonpress.
 3. The computer-implemented method of claim 1, wherein selectingthe first image capturing device based on a specification of the firstimage capturing device comprises selecting the first image capturingdevice when a field of view of the first image capturing device includesa point of interest in a field of view for the user, in the headset. 4.The computer-implemented method of claim 1, further comprisingde-activating the first image capturing device in the headset when theuser intention is incompatible with the specification of the imagecapturing device.
 5. The computer-implemented method of claim 1, furthercomprising selecting, from one or more image capturing devices in theheadset, the image capturing device whose specification best matches theuser intention.
 6. The computer-implemented method of claim 1, furthercomprising selecting a second image capturing device and de-activatingthe first image capturing device based on a second user intention. 7.The computer-implemented method of claim 1, wherein the specification ofthe first image capturing device is a field of view, and whereinselecting the first image capturing device comprises verifying that thefield of view includes an object of interest identified within the userintention.
 8. The computer-implemented method of claim 1, whereinreceiving a signal from a sensor mounted on a headset comprisesidentifying a hand gesture of the user indicative of an object ofinterest.
 9. The computer-implemented method of claim 1, whereinreceiving a signal from a sensor comprises receiving a pupil position ofthe user from an eye tracking device mounted on the headset.
 10. Thecomputer-implemented method of claim 1, wherein receiving a signal froma sensor comprises identifying an orientation of the headset, andselecting the first image capturing device comprises selecting a camerawhose field of view points along the orientation of the headset.
 11. Anaugmented reality headset, comprising: a first camera and a secondcamera mounted on a frame, having a first field of view, and a secondfield of view, respectively; a sensor mounted on the frame; a memoryconfigured to store multiple instructions; and one or more processorsconfigured to execute the instructions to cause the augmented realityheadset to: receive a signal from the sensor mounted, the signal beingindicative of an intention of a user for capturing an image, identifythe intention of the user for capturing the image, based on a model toclassify the signal from the sensor according to the intention of theuser, select one of the first camera or the second camera based on thefirst field of view, the second field of view, and the intention of theuser for capturing the image, and capture the image with a selectedcamera.
 12. The augmented reality headset of claim 11, wherein thesensor is an inertial motion unit, and to receive a signal from a sensorcomprises identifying an orientation of the augmented reality headsetrelative to a fixed coordinate system.
 13. The augmented reality headsetof claim 11, wherein to select one of the first camera or the secondcamera the one or more processors execute instructions to select thefirst camera when a field of view of the first camera includes a pointof interest in a field of view for the user, in the augmented realityheadset.
 14. The augmented reality headset of claim 11, wherein the oneor more processors execute instructions to de-activate the first camerawhen the second camera is selected, based on a second user intention.15. The augmented reality headset of claim 11, wherein the sensorprovides a signal indicative of a hand gesture of the user pointing toan object of interest.
 16. The augmented reality headset of claim 11,wherein the sensor is an eye tracking device mounted on the frame,configured to provide a signal indicative of a pupil position of theuser.
 17. The augmented reality headset of claim 11, wherein the sensoris an inertial motion sensor configured to provide a signal indicativeof an orientation of the augmented reality headset, and the one or moreprocessors execute instructions to select the first camera when thefirst field of view is aligned with the orientation of the augmentedreality headset.
 18. The augmented reality headset of claim 11, whereinthe sensor is one of the first camera or the second camera, configuredto capture a gesture of the user indicative of an object of interest.19. The augmented reality headset of claim 11, wherein the sensor is amicrophone configured to collect and identify a voice command indicativeof the intention of the user.
 20. The augmented reality headset of claim11, wherein the sensor is a touch-sensitive sensor configured to receivea touch command from the user.